Automatic Discovery of Contextual Factors Describing Phonological Variation

نویسندگان

  • Francine R. Chen
  • Jeff Shrager
چکیده

In this paper we describe a method for automatically discovering subsets of contextual factors which, taken together, axe useful for predicting the realizations, or pronunciations, of English words for continuous speech recognition. A decision tree is used for organizing contextual descriptions of phonological variation. This representation enables us to categorize different realizations according to the context in which they appear in the corpus. In addition, this organization permits us to consider simplifications such as pruning and branch clustering, leading to parsimonious descriptions that better predict allophones in these contexts. We created trees to examine the working assumption that preceding phoneme and following phoneme provide important contexts, as exemplified by the use of triphones in hidden Maxkov models; our results were in general accordance with the assumption. However, we found that other contexts also play a significant role in phoneme realizations. Introduction-Context Sensitivity in Realizations Phonologists claim that the context in which a phoneme occurs leads to consistent differences in how it is pronounced. For example, one phonological rule may state that the phoneme/ t / i s often flapped when it is preceded and followed by a vocalic (as in "butter"). The construction of these rules is typically an intricate process of theory formation, rule construction and then validation or disconfirmation of these rules. In this paper we describe an approach to partially automate rule construction, allowing for a larger number of examples to be examined and checked for consistencies. Our examples come from comparing transcriptions of spoken speech with a dictionary representation of the words spoken. We shall call the dictionary pronunciation symbols phonemes and define the realizations, or allophones, of a phoneme to be the set of transcription symbols corresponding to that phoneme. For example, pronunciations of the phoneme / t / inc lude the released, flapped, and unreleased realizations as characteristically occur in "tap", "butter", and "pat new", respectively. In addition, we shall refer to a context as having values. For example, the context stress has values primary, secondary, and unstressed. The approach is based on automatically forming and simplifying decision trees. Decision trees have been used for both understanding and classification of data (Henrichon and Fu, 1969). In our application, they provide a way of using context to organize the various realizations of a phoneme. The probability of a realization varies with the context in which the phoneme occurs. Contexts which have similar realization distributions are grouped together. The decision tree thus provides a method for representing the partitions of allophones with dissimilar probabilities, based on context. Decision trees can be formed automatically (Breiman et al., 1984; Quinlan, 1986) and converted to rules (Quinlan, 1987). The problem addressed here is to construct decision trees that are appropriate for use in the construction of pronunciation rules and for predicting realizations in context. In addition, an important part of the tree induction method is the discovery of appropriate descriptive categories for the formation of such trees. These categories often resemble theoretical categories, such as vocalic or plosive, and define the organization of the tree.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Contextually-Based Data-Derived Pronunciation Networks for Automatic Speech Recognition

The context in which a phoneme occurs leads to consistent differences in how it is pronounced. Phonologists employ a variety of contextual descriptors, based on factors such as stress and syllable boundaries, to explain phonological variation. However, in developing pronunciation networks for speech recognition systems, little explicit use is made of context other than the use of whole word mod...

متن کامل

Contextual effects on voicing profiles of German and Mandarin consonants

In this paper we present a study of the voicing profiles of consonants in Mandarin Chinese and German. The voicing profile is defined as the frame-by-frame voicing status of a speech sound in continuous speech. We are particularly interested in discrepancies between the phonological voicing status of a speech sound and its actual phonetic realization in connected speech. We further examine the ...

متن کامل

Integrating contextual phonological rules in a large vocabulary decoder

This paper presents an approach to the integratation of contextual phonological rules in the beam-search algorithm of a large vocabulary speech recognition system. The main interest of contextual transcription rules is that they implement constraints on pronunciations sequences which complement the bigram constraints on word sequences. As such, they should help avoiding acoustic confusions and ...

متن کامل

Systematic patterning in phonologically-motivated orthographic variation∗

Social media features a wide range of nonstandard spellings, many of which appear inspired by phonological variation. However, the nature of the connection between variation across the spoken and written modalities remains poorly understood. Are phonological variables transferred to writing on the level of graphemes, or is the larger system of contextual patterning also transferred? This paper ...

متن کامل

Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining

Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989